smiley face
emoji-development-face-tears-joy-book-keith-houston.html?via=rss
A couple of years ago, I frequently found myself driving past a roadside ice cream stand under construction. For weeks, the roof of this stand, a gigantic white swirl of fiberglass soft serve, sat on the ground next to the structure, waiting to be lowered onto the finished, cone-shaped building with a crane. I know what it was supposed to represent, but every time I glimpsed it, my instinctive first thought was There's a giant poop emoji. Keith Houston's history of emoji, Face With Tears of Joy, argues that emoji have "become so ubiquitous in our writing, so quotidian, that we should be talking about them in the same breath as grammar or punctuation." I don't know about grammar, which seems as fundamental to language, spoken and written, as words themselves.
'Synthetic human' robot twitches and spams into LIFE in terrifying new video
While we may be impressed by their artificial intelligence, humanoids often have an awkward, clunky gait. Now, experts have developed a robot with astonishingly lifelike movements โ thanks to synthetic muscles beneath translucent skin. Polish startup Clone Robotics has shared a terrifying new clip of Protoclone, its'faceless, anatomically accurate synthetic human'. Like something from the Terminator movies, the 6-foot prototype machine hangs from the ceiling in the company's secretive development workshop. As ominous music plays, Protoclone twitches its limbs back and forth with its head bowed, like a puppet brought to life in a mad scientist's lab.
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Jin, Chuhao, Tan, Wenhui, Yang, Jiange, Liu, Bei, Song, Ruihua, Wang, Limin, Fu, Jianlong
We propose a novel framework for learning high-level cognitive capabilities in robot manipulation tasks, such as making a smiley face using building blocks. These tasks often involve complex multi-step reasoning, presenting significant challenges due to the limited paired data connecting human instructions (e.g., making a smiley face) and robot actions (e.g., end-effector movement). Existing approaches relieve this challenge by adopting an open-loop paradigm decomposing high-level instructions into simple sub-task plans, and executing them step-by-step using low-level control models. However, these approaches are short of instant observations in multi-step reasoning, leading to sub-optimal results. To address this issue, we propose to automatically collect a cognitive robot dataset by Large Language Models (LLMs). The resulting dataset AlphaBlock consists of 35 comprehensive high-level tasks of multi-step text plans and paired observation sequences. To enable efficient data acquisition, we employ elaborated multi-round prompt designs that effectively reduce the burden of extensive human involvement. We further propose a closed-loop multi-modal embodied planning model that autoregressively generates plans by taking image observations as input. To facilitate effective learning, we leverage MiniGPT-4 with a frozen visual encoder and LLM, and finetune additional vision adapter and Q-former to enable fine-grained spatial perception for manipulation tasks. We conduct experiments to verify the superiority over existing open and closed-loop methods, and achieve a significant increase in success rate by 21.4% and 14.5% over ChatGPT and GPT-4 based robot tasks. Real-world demos are shown in https://www.youtube.com/watch?v=ayAzID1_qQk .
'I want to destroy whatever I want': Bing's AI chatbot unsettles US reporter
In the race to perfect the first major artificial intelligence-powered search engine, concerns over accuracy and the proliferation of misinformation have so far taken centre stage. But a two-hour conversation between a reporter and a chatbot has revealed an unsettling side to one of the most widely lauded systems โ and raised new concerns about what AI is actually capable of. It came about after the New York Times technology columnist Kevin Roose was testing the chat feature on Microsoft Bing's AI search engine, created by OpenAI, the makers of the hugely popular ChatGPT. The chat feature is currently only available to a small number of users who are testing the system. While admitting that he pushed Microsoft's AI "out of its comfort zone" in a way most users would not, Roose's conversation quickly took a bizarre and occasionally disturbing turn.
Beginner's Guide to Diffusion Models
Recently, there has been an increased interest in OpenAI's DALL-E, Stable Diffusion (the free alternative of DALL-E), and Midjourney (hosted on a Discord server). While AI-generated art is very cool, what is even more captivating is how it works in the first place. In the last section, I will include some resources for anyone to get started in this AI art space as well. So how do these technologies work? It uses something called a latent diffusion model, and the idea behind it is actually ingenious.
Hitting the Books: Robots came for our jobs, then they came for our coffee
We have no chance of escaping the coming robot revolution, nor should we want to. Our modern lives are already full of robots -- they're in our phones, our cars, hospitals and boardrooms, assisting everyone from factory workers to astrophysicists. They make our lives overwhelmingly better -- that is, until one gets between a hungover human and their morning jolt of java. In Talking to Robots, journalist and author David Ewing Duncan -- with help from some of today's leading scientific researchers -- presents 24 visions of the future and what our personal and professional interactions might look like once robots finish taking over. Need my hit of caffeine.
Smiley face dot com: GoDaddy releases EMOJI search engine and domain name registration
In May last year, emoji was named as the world's fastest growing language. In May last year, emoji was named as the world's fastest growing language. IS EMOJI THE FASTEST GROWING LANGUAGE? 'Most people have no idea they can just type a bunch of hearts in their address bar and go to a domain,' the company said. It has been possible to register domain names made up of emojis for a while now.
Teaching AI how to be sarcastic is totally the easiest thing ever
Pop culture has primed us to think that our robot friends of the future will be quick-witted quipsters. Television and films have their fair share of sardonic androids, from the foul-mouthed alcoholic Bender in Futurama to the acerbic J.A.R.V.I.S. from Ironman. But the reality of such amusing sidekicks remains far off. Cognitively speaking, sarcasm is one of the most complex forms of human expression--and it's therefore one of the hardest to teach AI systems. While voice recognition, machine translation, and other such tools are constantly improving, AI still lacks the ability to detect this uniquely human linguistic trait in either verbal or written conversation.
Diamonds From the Rough: Improving Drawing, Painting, and Singing via Crowdsourcing
Gingold, Yotam (Rutgers University and Columbia University) | Vouga, Etienne (Columbia University) | Grinspun, Eitan (Columbia University) | Hirsh, Haym (Rutgers University)
It is well established that in certain domains, noisy inputs can be reliablycombined to obtain a better answer than any individual.It is now possible to consider the crowdsourcing of physical actions,commonly used for creative expressions such as drawing, shading, and singing.We provide algorithms for converting low-quality inputobtained from the physical actions of a crowd into high-quality output.The inputs take the form of line drawings, shaded images, and songs.We investigate single-individual crowds (multiple inputs from a single human)and multiple-individual crowds.